Skip to content

Bootcamp/Euclid (Ephraim): Week 3 - Added an AI dataset generator with multiple providers#2052

Open
edfolmi wants to merge 1 commit intoed-donner:mainfrom
edfolmi:contribution/ai-dataset-generator-new
Open

Bootcamp/Euclid (Ephraim): Week 3 - Added an AI dataset generator with multiple providers#2052
edfolmi wants to merge 1 commit intoed-donner:mainfrom
edfolmi:contribution/ai-dataset-generator-new

Conversation

@edfolmi
Copy link
Contributor

@edfolmi edfolmi commented Mar 5, 2026

I built an AI dataset generator with Gradio UI. It uses large language models (LLMs) to generate structured synthetic data based on user input.

Features:

  • Generate any type of dataset, and download it in CSV format.
  • Specify the number of rows to generate.
  • Ability to select any provider and model of your choice. e.g HuggingFace, OpenRouter, OpenAI and Ollama, with any of their hosted models.

Gradio UI Interface Screenshot:
image

An example of a generated dataset in CSV Screenshot:
image

Screen recording:
https://drive.google.com/file/d/1-qfdLjOu9XIriVjuZvc4yHpsv1pTri_e/view?usp=sharing

@ranskills

@ranskills
Copy link
Contributor

Thanks, @edfolmi

Open the PR

@edfolmi edfolmi marked this pull request as ready for review March 5, 2026 10:24
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants